Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 20 de 29
Filter
1.
Journal of Laboratory and Precision Medicine ; 6(January) (no pagination), 2021.
Article in English | EMBASE | ID: covidwho-2269215
2.
Mining Informational and Analytical Bulletin ; 64(1):56-64, 2023.
Article in English | Scopus | ID: covidwho-2248983

ABSTRACT

This study aimed to get a better understanding of molecular epidemiology and genetic variation in the spike glycoprotein as a key viral component involved in viral entrance into host cells and as a potential vaccination target. Three Iraqi SARSCoV- 2 strains were investigated using whole-genome sequencing, with two of them clustering into the 20A (GH) clade, and the remaining strain is clustered in 20E (GV) clade, belonging to the B.1.36.1 and B.1.177.80 lineage, respectively. Wholegenome sequencing of the viral RNA samples revealed nine sporadic nonsynonymous uncommon mutations with frequency ranged from 0.00 to 0.19%. The ORF1ab, ORF1a, ORF3a, S, N, intergenic, ORF7 and ORF8 areas have seen the most changes. Furthermore, in all of our sequences, we discovered a D614G (aspartic acid to glycine) mutation in spike protein that co-occurred with an NSP12 P323L (viral RNA-dependent RNA polymerase) mutation. The findings point to several viral introductions in Iraq and provide new genetic information on SARSCoV- 2 at the worldwide level. Pathogenesis, diagnostics and vaccine development require information such as SNPs and mutations. © 2023 Publishing house Mining book. All rights reserved.

3.
Int Immunopharmacol ; 112: 109224, 2022 Nov.
Article in English | MEDLINE | ID: covidwho-2076214

ABSTRACT

In the worrisome scenarios of various waves of SARS-CoV-2 pandemic, a comprehensive bioinformatics pipeline is essential to analyse the virus genomes in order to understand its evolution, thereby identifying mutations as signature SNPs, conserved regions and subsequently to design epitope based synthetic vaccine. We have thus performed multiple sequence alignment of 4996 Indian SARS-CoV-2 genomes as a case study using MAFFT followed by phylogenetic analysis using Nextstrain to identify virus clades. Furthermore, based on the entropy of each genomic coordinate of the aligned sequences, conserved regions are identified. After refinement of the conserved regions, based on its length, one conserved region is identified for which the primers and probes are reported for virus detection. The refined conserved regions are also used to identify T-cell and B-cell epitopes along with their immunogenic and antigenic scores. Such scores are used for selecting the most immunogenic and antigenic epitopes. By executing this pipeline, 40 unique signature SNPs are identified resulting in 23 non-synonymous signature SNPs which provide 28 amino acid changes in protein. On the other hand, 12 conserved regions are selected based on refinement criteria out of which one is selected as the potential target for virus detection. Additionally, 22 MHC-I and 21 MHC-II restricted T-cell epitopes with 10 unique HLA alleles each and 17 B-cell epitopes are obtained for 12 conserved regions. All the results are validated both quantitatively and qualitatively which show that from genetic variability to synthetic vaccine design, the proposed pipeline can be used effectively to combat SARS-CoV-2.


Subject(s)
COVID-19 , Viral Vaccines , Humans , SARS-CoV-2/genetics , Epitopes, B-Lymphocyte , Epitopes, T-Lymphocyte , COVID-19 Vaccines/genetics , Computational Biology , Phylogeny , COVID-19/prevention & control , Immunogenicity, Vaccine , Vaccines, Synthetic/genetics , Amino Acids
4.
Microorganisms ; 10(10)2022 Oct 12.
Article in English | MEDLINE | ID: covidwho-2071646

ABSTRACT

Pathogens including viruses evolve in tandem with diversity in their animal and human hosts. For SARS-coV2, the focus is generally for understanding such coevolution on the virus spike protein, since it demonstrates high mutation rates compared to other genome regions, particularly in the receptor-binding domain (RBD). Viral sequences of the SARS-coV2 19B (S) clade and variants of concern from different continents were investigated, with a focus on the A.29 lineage, which presented with different mutational patterns within the 19B (S) lineages in order to learn more about how SARS-coV2 may have evolved and adapted to widely diverse populations globally. Results indicated that SARS-coV2 went through evolutionary constrains and intense selective pressure, particularly in Africa. This was manifested in a departure from neutrality with excess nonsynonymous mutations and a negative Tajima D consistent with rapid expansion and directional selection as well as deletion and deletion-frameshifts in the N-terminal domain (NTD region) of the spike protein. In conclusion, we hypothesize that viral transmission during epidemics through populations of diverse genomic structures and marked complexity may be a significant factor for the virus to acquire distinct patterns of mutations within these populations in order to ensure its survival and fitness, explaining the emergence of novel variants and strains.

5.
Trends in Sciences ; 19(17), 2022.
Article in English | Scopus | ID: covidwho-2057198

ABSTRACT

SARS-CoV-2 has very recently posed a potential threat to humanity due to its very rapid rate of mutations and repairing mechanism. The spread of this virus is considered to have occurred in Wuhan, China in December 2019. Characterized by high rates of transmission, the virus is constantly evolving towards attaining higher rates of stability and transmissibility through acquiring mutations in its genome. Therefore, this study aims to analyse the mutational profiles of SARS-CoV-2 isolates. Analysis of the mutational profiles in individual SARS-CoV-2 proteins will allow us to look into the rates of mutations associated with each protein. Frequently mutated residues have been identified in this research by aligning 688 SARS-CoV-2 nucleotide sequences, which were downloaded from NCBI (National Center For Biotechnology Information) repository. Further, mutational frequencies of these mutated residues have been studied, which is instrumental in identifying the proteins that are resistant to changes, as well as the ones that have a greater proclivity towards incorporating mutations. © 2022, Walailak University. All rights reserved.

6.
Biotechnol Genet Eng Rev ; : 1-19, 2022 Aug 10.
Article in English | MEDLINE | ID: covidwho-1984697

ABSTRACT

The number of studies and reviews conducted for the Carboxylesterase gene is limited in comparison with other enzymes. Carboxylesterase (CES) gene or human carboxylesterases (hCES) is a multigene protein belonging to the α/ß-hydrolase family. Over the last decade, two major carboxylesterases (CES1 and CES2), located at 16q13-q22.1 on human chromosome 16 have been extensively studied as important mediators in the metabolism of a wide range of substrates. hCES1 is the most widely expressed enzyme in humans, and it is found in the liver. In this review, details regarding CES1 substrates include both inducers (e.g. Rifampicin) and inhibitors (e.g. Enalapril, Diltiazem, Simvastatin) and different types of hCES1 polymorphisms (nsSNPs) such as rs2244613 and rs71647871. along with their effects on various CES1 substrates were documented. Few instances where the presence of nsSNPs exerted a positive influence on certain substrates which are hydrolyzed via hCES1, such as anti-platelets like Clopidogrel when co-administered with other medications such as angiotensin-converting enzyme (ACE) inhibitors were also recorded. Remdesivir, an ester prodrug is widely used for the treatment of COVID-19, being a CES substrate, it is a potent inhibitor of CES2 and is hydrolyzed via CES1. The details provided in this review could give a clear-cut idea or information that could be used for further studies regarding the safety and efficacy of CES1 substrate.

7.
Virus Res ; 315: 198765, 2022 07 02.
Article in English | MEDLINE | ID: covidwho-1768587

ABSTRACT

BACKGROUND: Emergence of new variant of SARS-CoV-2, namely omicron, has posed a global concern because of its high rate of transmissibility and mutations in its genome. Researchers worldwide are trying to understand the evolution and emergence of such variants to understand the mutational cascade events. METHODS: We have considered all omicron genomes (n = 302 genomes) available till 2nd December 2021 in the public repository of GISAID along with representatives of variants of concern (VOC), i.e., alpha, beta, gamma, delta, and omicron; variant of interest (VOI) mu and lambda; and variant under monitoring (VUM). Whole genome-based phylogeny and mutational analysis were performed to understand the evolution of SARS CoV-2 leading to emergence of omicron variant. RESULTS: Whole genome-based phylogeny depicted two phylogroups (PG-I and PG-II) forming variant specific clades except for gamma and VUM GH. Mutational analysis detected 18,261 mutations in the omicron variant, majority of which were non-synonymous mutations in spike (A67, T547K, D614G, H655Y, N679K, P681H, D796Y, N856K, Q954H), followed by RNA dependent RNA polymerase (rdrp) (A1892T, I189V, P314L, K38R, T492I, V57V), ORF6 (M19M) and nucleocapsid protein (RG203KR). CONCLUSION: Delta and omicron have evolutionary diverged into distinct phylogroups and do not share a common ancestry. While, omicron shares common ancestry with VOI lambda and its evolution is mainly derived by the non-synonymous mutations.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Mutation , SARS-CoV-2/genetics , Spike Glycoprotein, Coronavirus/genetics , Spike Glycoprotein, Coronavirus/metabolism
8.
Saudi J Biol Sci ; 29(5): 3494-3501, 2022 May.
Article in English | MEDLINE | ID: covidwho-1709574

ABSTRACT

In-silico studies on SARS-CoV-2 genome are considered important to identify the significant pattern of variations and its possible effects on the structural and functional characteristics of the virus. The current study determined such genetic variations and their possible impact among SARS-CoV-2 variants isolated in India. A total of 546 SARS-CoV-2 genomic sequences (India) were retrieved from the gene bank (NCBI) and subjected to alignment against the Wuhan variant (NC_045512.2), the corresponding amino acid changes were analyzed using NCBI Protein-BLAST. These 546 variants revealed 841 mutations; most of these were non-synonymous 464/841 (55.1%), there was no identical variant compared to the original strain. All genes; coding and non-coding showed nucleotide changes, most of the structural genes showed frequent nonsynonymous mutations. The most affected genes were ORF1a/b followed by the S gene which showed 515/841 (61.2%) and 120/841 (14.3%) mutations, respectively. The most frequent non-synonymous mutation 486/546 (89.01%) occurred in the S gene (structural gene) at position 23,403 where A changed to G leading to the replacement of aspartic acid by glycine in position (D614G). Interestingly, four variants also showed deletion. The variants MT800923 and MT800925 showed 12 consecutive nucleotide deletion in position 21982-21993 resulting in 4 consecutive amino acid deletions that were leucine, glycine, valine, and tyrosine in positions 141, 142, 143, and 144 respectively. The present study exhibited a higher mutations rate per variant compared to other studies carried out in India.

9.
Virus Res ; 308: 198642, 2022 01 15.
Article in English | MEDLINE | ID: covidwho-1525982

ABSTRACT

BACKGROUND: COVID-19 has posed unforeseen circumstances and throttled major economies worldwide. India has witnessed two waves affecting around 31 million people representing 16% of the cases globally. To date, the epidemic waves have not been comprehensively investigated to understand pandemic progress in India. OBJECTIVE: Here, we aim for pan Indian cross-sectional evolutionary analysis since inception of SARS-CoV-2. METHODS: High quality genomes, along with their collection date till 26th July 2021, were downloaded. Whole genome-based phylogeny was obtained. Further, the mutational analysis was performed using SARS-CoV-2 first reported from Wuhan (NC_045512.2) as reference. RESULTS: Based on reported cases and mutation rates, we could divide the Indian epidemic into seven phases. The average mutation rate for the pre-first wave was <11, which elevated to 17 in the first wave and doubled in the second wave (∼34). In accordance with mutation rate, VOCs and VOIs started appearing in the first wave (1.5%), which dominated the second (∼96%) and post-second wave (100%). Nation-wide mutational analysis depicted >0.5 million mutation events with four major mutations in >19,300 genomes, including two mutations in coding (spike (D614G), and NSP 12b (P314L) of rdrp), one silent mutation (NSP3 F106F) and one extragenic mutation (5' UTR 241). CONCLUSION: Whole genome-based phylogeny could demarcate post-first wave isolates from previous ones by point of diversification leading to incidences of VOCs and VOIs in India. Such analysis is crucial in the timely management of pandemic.


Subject(s)
COVID-19/virology , Genome, Viral , Phylogeny , SARS-CoV-2 , 5' Untranslated Regions , Cross-Sectional Studies , Epidemics , Genomics , Humans , India/epidemiology , Mutation , SARS-CoV-2/genetics , Spike Glycoprotein, Coronavirus/genetics
10.
Methods ; 203: 282-296, 2022 07.
Article in English | MEDLINE | ID: covidwho-1415845

ABSTRACT

Since the emergence of SARS-CoV-2 in Wuhan, China more than a year ago, it has spread across the world in a very short span of time. Although, different forms of vaccines are being rolled out for vaccination programs around the globe, the mutation of the virus is still a cause of concern among the research communities. Hence, it is important to study the constantly evolving virus and its strains in order to provide a much more stable form of cure. This fact motivated us to conduct this research where we have initially carried out multiple sequence alignment of 15359 and 3033 global dataset without Indian and the dataset of exclusive Indian SARS-CoV-2 genomes respectively, using MAFFT. Subsequently, phylogenetic analyses are performed using Nextstrain to identify virus clades. Consequently, the virus strains are found to be distributed among 5 major clades or clusters viz. 19A, 19B, 20A, 20B and 20C. Thereafter, mutation points as SNPs are identified in each clade. Henceforth, from each clade top 10 signature SNPs are identified based on their frequency i.e. number of occurrences in the virus genome. As a result, 50 such signature SNPs are individually identified for global dataset without Indian and dataset of exclusive Indian SARS-CoV-2 genomes respectively. Out of each 50 signature SNPs, 39 and 41 unique SNPs are identified among which 25 non-synonymous signature SNPs (out of 39) resulted in 30 amino acid changes in protein while 27 changes in amino acid are identified from 22 non-synonymous signature SNPs (out of 41). These 30 and 27 amino acid changes for the non-synonymous signature SNPs are visualised in their respective protein structure as well. Finally, in order to judge the characteristics of the identified clades, the non-synonymous signature SNPs are considered to evaluate the changes in proteins as biological functions with the sequences using PROVEAN and PolyPhen-2 while I-Mutant 2.0 is used to evaluate their structural stability. As a consequence, for global dataset without Indian sequences, G251V in ORF3a in clade 19A, F308Y and G196V in NSP4 and ORF3a in 19B are the unique amino acid changes which are responsible for defining each clade as they are all deleterious and unstable. Such changes which are common for both global dataset without Indian and dataset of exclusive Indian sequences are R203M in Nucleocapsid for 20B, T85I and Q57H in NSP2 and ORF3a respectively for 20C while for exclusive Indian sequences such unique changes are A97V in RdRp, G339S and G339C in NSP2 in 19A and Q57H in ORF3a in 20A.


Subject(s)
COVID-19 , SARS-CoV-2 , Amino Acids , COVID-19/epidemiology , COVID-19/genetics , Genome, Viral , Humans , Mutation , Phylogeny , Polymorphism, Single Nucleotide , SARS-CoV-2/genetics
11.
Gene Rep ; 25: 101044, 2021 Dec.
Article in English | MEDLINE | ID: covidwho-1385601

ABSTRACT

SARS-CoV-2 is mutating and creating divergent variants by altering the composition of essential constituent proteins. Pharmacologically, it is crucial to understand the diverse mechanism of mutations for stable vaccine or anti-viral drug design. Our current study concentrates on all the constituent proteins of 469 SARS-CoV-2 genome samples, derived from Indian patients. However, the study may easily be extended to the samples across the globe. We perform clustering analysis towards identifying unique variants in each of the SARS-CoV-2 proteins. A total of 536 mutated positions within the coding regions of SARS-CoV-2 proteins are detected among the identified variants from Indian isolates. We quantify mutations by focusing on the unique variants of each SARS-CoV-2 protein. We report the average number of mutation per variant, percentage of mutated positions, synonymous and non-synonymous mutations, mutations occurring in three codon positions and so on. Our study reveals the most susceptible six (06) proteins, which are ORF1ab, Spike (S), Nucleocapsid (N), ORF3a, ORF7a, and ORF8. Several non-synonymous substitutions are observed to be unique in different SARS-CoV-2 proteins. A total of 57 possible deleterious amino acid substitutions are predicted, which may impact on the protein functions. Several mutations show a large decrease in protein stability and are observed in putative functional domains of the proteins that might have some role in disease pathogenesis. We observe a good number of physicochemical property change during above deleterious substitutions.

12.
Comput Biol Med ; 135: 104654, 2021 08.
Article in English | MEDLINE | ID: covidwho-1313022

ABSTRACT

COVID-19 is an infectious and pathogenic viral disease caused by SARS-CoV-2 that leads to septic shock, coagulation dysfunction, and acute respiratory distress syndrome. The spreading rate of SARS-CoV-2 is higher than MERS-CoV and SARS-CoV. The receptor-binding domain (RBD) of the Spike-protein (S-protein) interacts with the human cells through the host angiotensin-converting enzyme 2 (ACE2) receptor. However, the molecular mechanism of pathological mutations of S-protein is still unclear. In this perspective, we investigated the impact of mutations in the S-protein and their interaction with the ACE2 receptor for SAR-CoV-2 viral infection. We examined the stability of pathological nonsynonymous mutations in the S-protein, and the binding behavior of the ACE2 receptor with the S-protein upon nonsynonymous mutations using the molecular docking and MM_GBSA approaches. Using the extensive bioinformatics pipeline, we screened the destabilizing (L8V, L8W, L18F, Y145H, M153T, F157S, G476S, L611F, A879S, C1247F, and C1254F) and stabilizing (H49Y, S50L, N501Y, D614G, A845V, and P1143L) nonsynonymous mutations in the S-protein. The docking and binding free energy (ddG) scores revealed that the stabilizing nonsynonymous mutations show increased interaction between the S-protein and the ACE2 receptor compared to native and destabilizing S-proteins and that they may have been responsible for the virulent high level. Further, the molecular dynamics simulation (MDS) approach reveals the structural transition of mutants (N501Y and D614G) S-protein. These insights might help researchers to understand the pathological mechanisms of the S-protein and provide clues regarding mutations in viral infection and disease propagation. Further, it helps researchers to develop an efficient treatment approach against this SARS-CoV-2 pandemic.


Subject(s)
COVID-19 , SARS-CoV-2 , Angiotensin-Converting Enzyme 2 , Humans , Molecular Docking Simulation , Molecular Dynamics Simulation , Mutation , Peptidyl-Dipeptidase A/genetics , Protein Binding , Spike Glycoprotein, Coronavirus/genetics
13.
Genomics Inform ; 19(2): e15, 2021 Jun.
Article in English | MEDLINE | ID: covidwho-1311444

ABSTRACT

The direction of evolution can estimate based on the variation among nonsynonymous to synonymous substitution. The simulative study investigated the nucleotide sequence of closely related strains of respiratory syndrome viruses, codon-by-codon with maximum likelihood analysis, z selection, and the divergence time. The simulated results, dN/dS > 1 signify that an entire substitution model tends towards the hypothesis's positive evolution. The effect of transition/transversion proportion, Z-test of selection, and the evolution associated with these respiratory syndromes, are also analyzed. Z-test of selection for neutral and positive evolution indicates lower to positive values of dN-dS (0.012, 0.019) due to multiple substitutions in a short span. Modified Nei-Gojobori (P) statistical technique results also favor multiple substitutions with the transition/transversion rate from 1 to 7. The divergence time analysis also supports the result of dN/dS and imparts substantiating proof of evolution. Results conclude that a positive evolution model, higher dN-dS, and transition/transversion ratio significantly analyzes the evolution trend of severe acute respiratory syndrome coronavirus 2, severe acute respiratory syndrome coronavirus, and Middle East respiratory syndrome coronavirus.

14.
Virus Evol ; 7(1): veab041, 2021 Jan.
Article in English | MEDLINE | ID: covidwho-1243512

ABSTRACT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causes acute, highly transmissible respiratory infection in humans and a wide range of animal species. Its rapid global spread has resulted in a major public health emergency, necessitating commensurately rapid research to improve control strategies. In particular, the ability to effectively retrace transmission chains in outbreaks remains a major challenge, partly due to our limited understanding of the virus' underlying evolutionary dynamics within and between hosts. We used high-throughput sequencing whole-genome data coupled with bottleneck analysis to retrace the pathways of viral transmission in two nosocomial outbreaks that were previously characterised by epidemiological and phylogenetic methods. Additionally, we assessed the mutational landscape, selection pressures, and diversity at the within-host level for both outbreaks. Our findings show evidence of within-host selection and transmission of variants between samples. Both bottleneck and diversity analyses highlight within-host and consensus-level variants shared by putative source-recipient pairs in both outbreaks, suggesting that certain within-host variants in these outbreaks may have been transmitted upon infection rather than arising de novo independently within multiple hosts. Overall, our findings demonstrate the utility of combining within-host diversity and bottleneck estimations for elucidating transmission events in SARS-CoV-2 outbreaks, provide insight into the maintenance of viral genetic diversity, provide a list of candidate targets of positive selection for further investigation, and demonstrate that within-host variants can be transferred between patients. Together these results will help in developing strategies to understand the nature of transmission events and curtail the spread of SARS-CoV-2.

15.
J Med Virol ; 93(4): 2406-2419, 2021 04.
Article in English | MEDLINE | ID: covidwho-1227754

ABSTRACT

The analyses of 2325 severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) genomes revealed 107, 162, and 65 nucleotide substitutions in the coding region of SARS-CoV-2 from the three continents America, Europe, and Asia, respectively. Of these nucleotide substitutions 58, 94, and 37 were nonsynonymous types mostly present in the Nsp2, Nsp3, Spike, and ORF9. A continent-specific phylogram analyses clustered the SARS-CoV-2 in the different group based on the frequency of nucleotide substitutions. Detailed analyses about the continent-specific amino acid changes and their effectiveness by SNAP2 software was investigated. We found 11 common nonsynonymous mutations; among them, two novel effective mutations were identified in ORF9 (S194L and S202N). Intriguingly, ORF9 encodes nucleocapsid phosphoprotein possessing many effective mutations across continents and could be a potential candidate after the spike protein for studying the role of mutation in viral assembly and pathogenesis. Among the two forms of certain frequent mutation, one form is more prevalent in Europe continents (Nsp12:L314, Nsp13:P504, Nsp13:Y541, Spike:G614, and ORF8:L84) while other forms are more prevalent in American (Nsp12:P314, Nsp13:L504, Nsp13:C541, Spike:D614, and ORF8:L84) and Asian continents (Spike:D614), indicating the spatial and temporal dynamics of SARS-CoV-2. We identified highly conserved 38 regions and among these regions, 11 siRNAs were predicted on stringent criteria that can be used to suppress the expression of viral genes and the corresponding reduction of human viral infections. The present investigation provides information on different mutations and will pave the way for differentiating strains based on virulence and their use in the development of better antiviral therapy.


Subject(s)
COVID-19/virology , Mutation , SARS-CoV-2/genetics , Antiviral Agents/pharmacology , Asia/epidemiology , COVID-19/epidemiology , Coronavirus Nucleocapsid Proteins/genetics , Coronavirus Papain-Like Proteases/genetics , Europe/epidemiology , Gene Silencing , Genes, Viral , Genome, Viral , Humans , Open Reading Frames , Phosphoproteins/genetics , Phylogeny , RNA, Small Interfering/genetics , SARS-CoV-2/classification , SARS-CoV-2/drug effects , Viral Nonstructural Proteins/genetics , Viral Proteins/genetics , COVID-19 Drug Treatment
16.
J Med Virol ; 93(4): 2115-2131, 2021 04.
Article in English | MEDLINE | ID: covidwho-1217370

ABSTRACT

The global outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) urgently requires an effective vaccine for prevention. In this study, 66 epitopes containing pentapeptides of SARS-CoV-2 spike protein in the IEDB database were compared with the amino acid sequence of SARS-CoV-2 spike protein, and 66 potentially immune-related peptides of SARS-CoV-2 spike protein were obtained. Based on the single-nucleotide polymorphisms analysis of spike protein of 1218 SARS-CoV-2 isolates, 52 easily mutated sites were identified and used for vaccine epitope screening. The best vaccine candidate epitopes in the 66 peptides of SARS-CoV-2 spike protein were screened out through mutation and immunoinformatics analysis. The best candidate epitopes were connected by different linkers in silico to obtain vaccine candidate sequences. The results showed that 16 epitopes were relatively conservative, immunological, nontoxic, and nonallergenic, could induce the secretion of cytokines, and were more likely to be exposed on the surface of the spike protein. They were both B- and T-cell epitopes, and could recognize a certain number of HLA molecules and had high coverage rates in different populations. Moreover, epitopes 897-913 were predicted to have possible cross-immunoprotection for SARS-CoV and SARS-CoV-2. The results of vaccine candidate sequences screening suggested that sequences (without linker, with linker GGGSGGG, EAAAK, GPGPG, and KK, respectively) were the best. The proteins translated by these sequences were relatively stable, with a high antigenic index and good biological activity. Our study provided vaccine candidate epitopes and sequences for the research of the SARS-CoV-2 vaccine.


Subject(s)
COVID-19 Vaccines/immunology , COVID-19/virology , Epitopes, B-Lymphocyte/immunology , Epitopes, T-Lymphocyte/immunology , SARS-CoV-2/immunology , Spike Glycoprotein, Coronavirus/immunology , Amino Acid Sequence , Computational Biology , Humans , Immunogenicity, Vaccine
17.
Virus Res ; 298: 198401, 2021 06.
Article in English | MEDLINE | ID: covidwho-1157779

ABSTRACT

Since the onslaught of SARS-CoV-2, the research community has been searching for a vaccine to fight against this virus. However, during this period, the virus has mutated to adapt to the different environmental conditions in the world and made the task of vaccine design more challenging. In this situation, the identification of virus strains is very much timely and important task. We have performed genome-wide analysis of 10664 SARS-CoV-2 genomes of 73 countries to identify and prepare a Single Nucleotide Polymorphism (SNP) dataset of SARS-CoV-2. Thereafter, with the use of this SNP data, the advantage of hierarchical clustering is taken care of in such a way so that Average Linkage and Complete Linkage with Jaccard and Hamming distance functions are applied separately in order to identify the virus strains as clusters present in the SNP data. In this regard, the consensus of both the clustering results are also considered while Silhouette index is used as a cluster validity index to measure the goodness of the clusters as well to determine the number of clusters or virus strains. As a result, we have identified five major clusters or virus strains present worldwide. Apart from quantitative measures, these clusters are also visualized using Visual Assessment of Tendency (VAT) plot. The evolution of these clusters are also shown. Furthermore, top 10 signature SNPs are identified in each cluster and the non-synonymous signature SNPs are visualised in the respective protein structures. Also, the sequence and structural homology-based prediction along with the protein structural stability of these non-synonymous signature SNPs are reported in order to judge the characteristics of the identified clusters. As a consequence, T85I, Q57H and R203M in NSP2, ORF3a and Nucleocapsid respectively are found to be responsible for Cluster 1 as they are damaging and unstable non-synonymous signature SNPs. Similarly, F506L and S507C in Exon are responsible for both Clusters 3 and 4 while Clusters 2 and 5 do not exhibit such behaviour due to the absence of any non-synonymous signature SNPs. In addition to all these, the code, SNP dataset, 10664 labelled SARS-CoV-2 strains and additional results as supplementary are provided through our website for further use.


Subject(s)
COVID-19/virology , Genome, Viral , Polymorphism, Single Nucleotide , SARS-CoV-2/classification , SARS-CoV-2/genetics , COVID-19/epidemiology , Databases, Nucleic Acid , Evolution, Molecular , Humans , Mutation , Pandemics , Sequence Alignment
18.
Pathogens ; 10(2)2021 Feb 09.
Article in English | MEDLINE | ID: covidwho-1134211

ABSTRACT

In December 2019, the first cases of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) were identified in the city of Wuhan, China. Since then, it has spread worldwide with new mutations being reported. The aim of the present study was to monitor the changes in genetic diversity and track non-synonymous substitutions (dN) that could be implicated in the fitness of SARS-CoV-2 and its spread in different regions between December 2019 and November 2020. We analyzed 2213 complete genomes from six geographical regions worldwide, which were downloaded from GenBank and GISAID databases. Although SARS-CoV-2 presented low genetic diversity, there has been an increase over time, with the presence of several hotspot mutations throughout its genome. We identified seven frequent mutations that resulted in dN substitutions. Two of them, C14408T>P323L and A23403G>D614G, located in the nsp12 and Spike protein, respectively, emerged early in the pandemic and showed a considerable increase in frequency over time. Two other mutations, A1163T>I120F in nsp2 and G22992A>S477N in the Spike protein, emerged recently and have spread in Oceania and Europe. There were associations of P323L, D614G, R203K and G204R substitutions with disease severity. Continuous molecular surveillance of SARS-CoV-2 will be necessary to detect and describe the transmission dynamics of new variants of the virus with clinical relevance. This information is important to improve programs to control the virus.

19.
Viruses ; 13(1)2021 Jan 19.
Article in English | MEDLINE | ID: covidwho-1060287

ABSTRACT

Since the identification of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as the etiological agent of the current COVID-19 pandemic, a rapid and massive effort has been made to obtain the genomic sequences of this virus to monitor (in near real time) the phylodynamic and diversity of this new pathogen. However, less attention has been given to the assessment of intra-host diversity. RNA viruses such as SARS-CoV-2 inhabit the host as a population of variants called quasispecies. We studied the quasispecies diversity in four of the main SARS-CoV-2 genes (ORF1a, ORF1b, S and N genes), using a dataset consisting of 210 next-generation sequencing (NGS) samples collected between January and early April of 2020 in the State of Victoria, Australia. We found evidence of quasispecies diversity in 68% of the samples, 76% of which was nonsynonymous variants with a higher density in the spike (S) glycoprotein and ORF1a genes. About one-third of the nonsynonymous intra-host variants were shared among the samples, suggesting host-to-host transmission. Quasispecies diversity changed over time. Phylogenetic analysis showed that some of the intra-host single-nucleotide variants (iSNVs) were restricted to specific lineages, highlighting their potential importance in the epidemiology of this virus. A greater effort must be made to determine the magnitude of the genetic bottleneck during transmission and the epidemiological and/or evolutionary factors that may play a role in the changes in the diversity of quasispecies over time.


Subject(s)
Coronavirus Nucleocapsid Proteins/genetics , Genome, Viral/genetics , Quasispecies/genetics , SARS-CoV-2/genetics , Spike Glycoprotein, Coronavirus/genetics , Viral Proteins/genetics , Australia , Base Sequence , COVID-19/virology , Genetic Variation , High-Throughput Nucleotide Sequencing , Phylogeny , Polyproteins/genetics , Sequence Analysis, RNA , Victoria
20.
Gene Rep ; 23: 101024, 2021 Jun.
Article in English | MEDLINE | ID: covidwho-1033329

ABSTRACT

SARS-CoV-2, the causal agent of COVID 19, is a new human pathogen that appeared in Wuhan, late December 2019. SARS-CoV-2 is a positive sense RNA virus, having four structural and six accessory proteins including that encoded by ORF8 gene known to be one of the most hypervariable and rapidly evolving genes. Thus, global characterization of mutations in this gene is important for pathogenicity and diagnostics. 240 different nonsynonymous mutations and 2 deletions were identified in 45,400 ORF8 nucleotide sequences during six months pandemic with about half of these variants were deleterious for ORF8, and the quarter of them were located in conserved amino acids. Genetic diversity analysis showed two main regions that harbor L84S and S24L. L84S is by far the most predominant mutation, followed by S24L that appeared first in USA. Phylogenetic analysis of ORF8 variants revealed the appearance of small clades with that of L84S being closer to bats. This is the first study that revealed the global nonsynonymous mutations in ORF8 from January to June 2020.

SELECTION OF CITATIONS
SEARCH DETAIL